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SUMMARY 


In  contemporary  society  many  resources  are  devoted  to  skill  training. 
There  is  a  large  hody  of  psychological  research  aimed  at  determining  the 
optimal  training  procedures.  Hcvever,  relatively  few  of  these  studies  have 
examined  truly  long-term  .etention  of  skills.  The  present  paper  reviews  the 
existing  literature  on  long-term  retention  and  aiso  examines  a  number  of 
psycholcgicc-i  mechanisms  wliich  have  been  demonstrated  to  be  crucial  in  studies 
of  relatively  short-term  learning  and  should  have  important  implications  for 
long-term  retention.  For  example,  on  the  basis  of  recent  research  in  cognitive 
psychology,  it  is  conjectured  that  maintenance  of  a  skill  will  depend  crucially 
on  whether  that  skill  is  automatic;  that  is,  can  be  performed  without  conscious 
awareness.  Some  of  the  additional  mechanisms,  including  intratask  interference 
and  transfer-appropriate  processing,  point  to  the  i’'.portance  of  the 
relationship  between  original  learning  and  final  test  conditions. 
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PREFACE 


Hiis  paper  presents  a  review  of  the  literature  on  skill  mainto.nance  as 
well  as  a  theoretical  analysis  of  that  subject-  Tnis  paper  is  ineant  Lu 
provide  the  f ra-Tev'crk  with  which  we  could  begin  a  progran  of  researcr.  on  t:.e 
maintenance  of  skill.  A  coirpanion  piece  (Healy  et  al.,  .:.98c),  separately 
published,  provides  son>e  sample  methodologies  we  have  developed  to  investigate 
this  opic.  VJe  wish  to  acknowledge  the  help  of  Antoinette  Gesi  and  Debbie 
Aguiar  in  the  preparation  of  this  paper. 
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SKILL  HAINIENANCE:  LITERATURE  REVIEW  AITO  THEORETICAL  ANALYSIS 


In  contemporary  society,  enormous  resources  are  devoted  to  education  and 
skill  training.  The  average  person  spends  about  12  years  in  school,  follo’.ved  by 
either  more  tcrmal  education,  military  training,  or  technical  training.  Given 
trie  magnitude  of  time  and  resources  devoted  to  the  learning  of  knowledge  and 
skills,  there  is  considerable  merit  in  efforts  to  ensure  the  effectiveness  of 
such  training.  Regardless  of  'he  skill  domain,  the  goal  of  training  is  to 
develop  a  high  level  of  performance  on  some  category  of  jobs  or  tasks,  ensuring 
that  the  requisite  skills  are  available  for  application  as  needed  subsequently 
in  the  field.  Psychological  research  has  provided  an  extensive  database  on 
which  effective  training  procedures  can  be  designed.  The  generality  of  this 
body  of  research  is,  however,  limited  by  the  fact  that  relatively  few  studies 
have  examined  truly  long-term  retention. 

The  purpose  of  this  paper  is  to  review  the  availcible  literature  on 
long-term  retention.  Further,  we  v.  11  examine  a  number  of  psychological 
mechanisms  which  may  have  important  implications  for  long-term  retention. 

Review  of  Studies  of  Skill  Retention 


Retention  £f  Kriowledge 

Tile  nmst  important  recent  work  i  this  area  has  been  conducted  by  B<»hrick 
(1979,  1984)  and,  therefcvore,  this  work  will  be  reviewed  in  some  detail.  Bahcick 
(1079)  investigated  t.he  acquisition  and  maintenance  oi  vaiioua  focms  of 
knowledge  over  retention  intervals  considerably  longer  than  tiio.se  typica.Py  used 
in  laboratory  experiments.  He  developed  two  different  methods  to  evalua.o  the 
extent  to  which  maintenance  of  knowledge  depended  on  periodic  access  to  the 
learned  material.  The  first  method  involves  a  cross-succional  statistical 
procedure.  The  use  of  this  method  requires  the  availabi.lity  of  a  large  number 
of  subjects  who  acquired  the  same  knowledge  .at  different  times  in  the  past.  It 
is  also  essential  that  the  subject.^  be  able  to  estimate  the  amount  of  rehearsal 
of  that  know  edge  during  the  interval  between  original  acquisition  and  the 
retention  test.  The  degree  of  original  acquisition  of  the  material  for  each 
subject  must  also  be  obtaineible.  Subjects  are  then  assigne ''  to  groups  depending 
on  when  they  acquired  the  information,  and  a  retention  function  is  calculated 
based  on  their  test  performance.  This  retention  furiction  is  then  corrected, 
using  multiple  regressic  i  techniques,  for  factors  contribucing  tc  tlae  original 
level  of  performance  and  the  extent  of  rehearsal  or  practi c:  during  the 
retention  interval.  Bahrick  used  this  technique  to  study  the  .intenance  of 
knowledge  about  the  names  and  spatial  locations  of  buildings  sui  .aets  within 
a  university  city.  He  concluded  from  his  results  that  spatial  information  was 
lost  a«ore  rapidly  than  information  e±xDut  names  but  that  the  two  types  of 
information  could  be  recovered  with  equal  ease.  Specifically,  on  the  average, 
7-8  visits  per  year  were  necessary  to  maintain  spatial  information  about  street 
sequence  and  8-9  vi.sits  per  year  were  necessary  for  maintaining  the  street  names 
at  the  level  of  graduating  seniors.  Trade-off  estimates  were  also  reported 
ev-aluating  the  elative  importance  of  the  recency,  duration,  'nd  freqi'.-ncy  of 
the  visits  when  tne  knowledge  was  rehearsed  c  refreshed. 

In  a  similar  cross-sectional  study,  Bahrick  (1934)  administered  a  large 
battery  of  tests  of  knowledge  of  Spanish  to  more  than  7'..  participants  whose 
last  exposure  '‘o  a  course  in  Spanish  ranged  from  0  to  almost  50  years. 
Participants  were  also  selected  and  grouped  according  to  their  final  level  of 


training  in  Spanish.  Questionnaires  concernincj  the  aiaount  and  recency  of 
practice  and  exjxjsure  to  Spanish  were  administered  to  the  participants  in  order 
to  obtain  estimates  of  the  araoiont  of  rehearsal  necessary  to  maintain  knowledge 
over  time.  Rehearsal  levels  during  the  retention  interval  'vere  found  to  be 
cruite  low  and  unrelated  to  the  level  of  knov/ledge  retention.  Reliable 
t  redictors  of  long-term  retention  v/ece  training  level,  mean  grade  in  Spa’iish 
courses,  and  the  level  of  training  in  other  romance  languages.  The  retention 
f'lnction  over  50  years  indicated  chat  memory  level  dropped  for  about  6  years  co 
a  stable  asymptote  and  then  remained  constant  for  about  50  years,  at  idiich  time 
it  again  started  to  drop  (possibly  because  of  neurological  deterioration 
attributable  to  old  age).  Higher  levels  of  original  training  and  higher  mean 
grades  were  independently  associated  with  overall  higher  retention.  Because 
very  little  rehearsal  was  reported  by  the  participants,  it  appeared  that  much  of 
the  information  originally  le-  led  was  retained  in  a  "permastorp"  and  that  it 
remained  accessible  for  many  years  without  periodic  maintenance  activities. 
Because  the  rehearsal  predictors  wets  unrelated  to  retention  in  this  study, 
Bahrick  concluded  that  most  of  what  influences  the  amount  of  long-term  retention 
is  determined  by  acquisition  processes  and  not  by  rehearsal  effects  duri.nq  the 
retention  interval.  This  conclusion  muse  be  viewed  as  tentative,  however,  since 
it  is  not  known  whether  greater  amounts  of  rehearsal  during  the  retention 
interval  would  have  increased  long-term  retention.  This  rich  method  is 
particularly  attractive  for  investigating  skill  maintenance  because  it  is 
applicable  to  skills  learned  outside  the  laboratory  under  ecologically  realistic 
conditions. 

The  second  method,  developed  by  P  \hrick  (1974)  to  study  t-he  maintenance  of 
knowledge,  is  conducted  solely  in  the  laboratory  and  involves  successive 
relearning  sessions,  in  t(>e  original,  learning  session.,  tlic  infoiiriaticn  to  he 
acquired  is  tested  and  repeated  with  a  dropout  technique  'vhich  ensures  chat 
every  item  receives  the  same  n'lmber  of  correct  responses.  Si±>sequent  relearning 
sessions  start  with  a  test  of  ail  the  items  origi''aily  learned  and  then  continue, 
with  the  dropout  technique  used  in  the  original  learning  session.  Among  other 
materials,  Bahrick  applied  this  technique  to  the  learning  of  Etigiish-Spanish 
word  pairs.  He  systematically  varied  the  intersession  interval  (from  0  to  30 
days)  and  examined  performance  during  the  original  learning  session,  in  tlie  two 
or  five  subsequent  relearning  sessions,  and  at  a  final  test  session  whicit 
followed  30  days  after  the  last  relearning  session.  He  found  that  the 
information  was  maintained  at  a  high  level  across  lengthy  intersession 
intervals,  and  that  performance  in  the  final  test  session  depended  more  on  tJie 
earlier  intersession  intervals  their'  on  the  level  of  performance  reached  in  the 
last  relearning  session.  Perfo,  oance  in  the  last  rel  cning  session  was 
greatest  when  the  inters timulus  intetx'als  were,  the  shortest  But  peri-ormance  in 
the  final  test  session  was  greatest  when  tiie  earlier  inter stimulus  intervals 
were  tile  longest  so  t.hat  they  matched  the  interval  liea’/reen  the  last  relearning 
session  and  the  final  test  session.  BeuhricK  concluded  from  these  results  that 
for  optimum  mainteneuice  of  knowledge,  practice  sho'uld  be  .spaced  at  intervals  n 
much  shorter  than  the  interval  separating  practice  from  test. 

The  most  important  point  of  thif-  study  with  cesuect  to  the  maintenance  of 
skills  is  that  people  who  exhib  c  she  some  h.-vo].  of  skill  proficiency 
■mmpdiateiy  after  training  can  d.;  ffer  substantially  on  long-term  retention, 
depending  on  their  previous  training  liistory.  From  tdiis  result,  we  must  reject 
the  intuitive  notion  that  c-t.’.tei ion  performance  at  the  end  of  training  alone  is 
a  sufficient  predictor  of  long-term  reter-tion.  Based  on  studies  of  intratask 


interference,  Battig  (l^Tg)  provided  strong  convergent  evidence  for  this 
viewpoint,  and  his  research  efforts  will  be  discussed  below. 

] ntratask  Interference 

On  the  basis  cf  an  earlier  extensive  r»-,’iew  of  the  liter.: 'lur  ' ,  3attj-g 
(19'^9)  proposed  a  general  intratask  interference  principle  cf  memory:  Greater 
difficulty  or  interference  at  the  time  of  learning  produces  higher  levels  of 
subsequent  long-term  retention  and  transfer  of  what  has  been  learned.  There 
have  been  enough  different  demonstrations  of  this  p.henomenon,  including  those 
cited  in  Battio's  review  and  others  which  have  since  teen  published  (e.g.,  Shea 
&  Zimny,  198 ?d ,  to  conclude  that  intratask  interference  is  a  patent  contributor 
to  permanence  in  memory. 

We  cite  in  Table  1,  12  articles  that  illustrate  the  methods  and  results  of 
previous  research  on  intratask  interference.  This  List  is  by  no  means 
exhaustive.  The  reader  is  referred  to  Battig' s  (1979)  original  article  for  a 
much  more  extensive  bibliography.  The  first  thing  to  note  is  that  whereas  these 
studies  represent  a  wide  range  of  prcb’ems  and  procedures,  they  tend  to  center 
on  verbal  tasks.  Only  the  studi*.  .  by  Hiew  (1977),  using  concept  formation 
problems,  and  Shea  and  Morgan  (1979),  using  a  serial  motor  response  tasic,  fall 
outside  the  strictly  verbal  domain.  Thus,  one  concern  of  research  is  the 
generality  of  this  principle  to  motor  skills  tasks  amd  tasks  that  have  a  greater 
perceptxial  or  problem-solving  component. 

TJLbie  i  lists  these  .ustrations  of  intratask  interferencp  effpct-s-  tach 
listing  includes  a  brief  description  of  the  interference-producing  manipulation 
within  each  experiment,  followed  by  thre  numerical  values,  the  first  two  of 
which  represent  percent  of  maximum  retention  or  transfer  under  emd  high 
interference  conditions.  Due  to  marked  variations  in  overall  performance  levels 
across  the  various  conditions  of  experiments,  a  derived  measure  is  given  in 
the  final  column,  which  represents  the  difference  between  the  high  and  low 
values  divided  by  the  maximum  possible  difference  ( 100 r  or  perfect  perfoLUvance 
minus  the  low  value)  times  100.  I:  other  words,  %MAX  represents  the  percentage 
of  mav.imum  facilitation  found  uitder  the  high  interference  conditions,  so  that 
larger  %MAX  values  are  indicative  of  greater  magnitudes  of  the  intratask 
interference  phencinenon. 

Tills  listing  experiments  is  inter.. led  primarily  to  document  the  wide 
range  of  experimental  evidence  consistent  with  Battig's  principle.  It  should  be 
obvious  from  even  a  cursory  examination  of  the  various  experimental 
manipi  tions  employed  in  these  experiments  that  any  attempt  to  summarize  or 
make  mtraningCul  comparisons  across  experiments  is  premature.  Nonetheless,  a  few 
tu.ctative  statements  e±)Out  these  results  are  possible.  First,  intratask 
interference  effects  are  generally  siiuabir.-,  averaging  over  cne-thi-d  of  the 
maxiinum  facilitation  that  would  be  possible  for  high  relative  to  low 
interference  conditions.  Second,  these  studies  appear  to  define  two  distinct 
bases  for  interference,  viz  ,  similarity  of  the  items  composing  the  learning 
i'  t  and  variations  in  the  context  in  which  the  learning  unit  appears.  For 
example.  Experiments  Number  1,  2,  3,  and  4  in  Table  1  represent  variations 
pri-marily  in  the  context  in  hich  any  given  item  to  be  learned  is  presented  for 
study  or  test,  in  contrast,  ixperiments  Number  5,  6,  7,  and  8  appear  to  qualify 
a.s  manipulations  primarily  of  the  degree  to  which  items  constituting  a  verbal 
list  or  some  other  set  of  material  to  be  learned  are  similar  to  one  another. 
Although  none  of  these  demonstrations  included  independent  manipulations  of  both 
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Table  1.  Twelve  Selected  Experiments  on  Intratask  Interference 


Percentage 

Retention  or  Transfer 


Description  of  Experiment 

Low 

"High 

%Max 

Cont'jxtual  Variation 

1. 

Dattig  (1972)  Separation  vs.  mixing  of  paired- 
associate  pairings  and  recall  tests 

40.3 

60.4 

23.4 

2. 

Hiew  (1977)  Blocked  vs.  mixed  training  on 
sets  of  conceptual  rules 

15.3 

8.5 

44.4 

3. 

Nitsch  (1977)  Constant  vs.  varied  exemplars 
of  verbal  contexts 

34.1 

79.5 

68.9 

4. 

underwood  &  Lund  (1979)  Simultaneous  vs. 
sequential  list  learning 

58.5 

77.3 

45.3 

Similarity  of  Items 

5. 

Chiesi  (1976)  Paired-associate  response 
conceptual  similarity 

50.3 

63.2 

26.0 

6. 

Pagel  (1973)  Paired-associate  stimulus  formal 
similarity  i.also  stimulus  meaiiirigfulueaS) 

8.3 

23.8 

16.9 

7. 

Pellegrino  (1972)  Paired-associate  response 
foonal  similarity 

79.9 

90.9 

54.7 

8. 

Posnanaky  (1974)  Seiial  intraiist  similarity 

54.6 

95.0 

89.0 

Combined  Manipulations 

9. 

Einstein  (197t  i  Extraneous  processing  of 
related  words  during  free  recall  learning 

52.0 

60.7 

34.3 

10. 

Johnson  (1964)  Shape-label  similarity 
and  congruence 

31.9 

42.2 

15.1 

11. 

3chab  (1975)  Presentation  of  related  paired- 
associate  lists  on  alternate  trials 

57.1 

71.4 

33.3 

17. 

Shea  &  Morgan  (1979)  Blocked  vs.  random  trial 
sequences  in  motor  skills  task 

4.3 

2.1 

51 . 5 

Mean 

34.9 

the  learning  unit  and  its  context.  Experiments  Number  9,  10,  li,  and  12  combine 
these  factors. 

Comparisons  of  the  %MAX  values  in  the  last  column  of  Table  1  show 
substantial  magnitudes  of  intratask  int  'tference  for  all  three  classes  of 
experiments.  Th*"  evidence  available  far  s  to  give  us  clear  indications, 
however,  of  L  j  relative  magnitudes  of  effects  attributable  to  contextual 
variation  to  similarity  of  the  items  in  the  learning  unit,  nor  does  it  ceil 
us  whether  a  combination  of  manipulations  would  produce  additive  or  some  other 
systematic  effects.  These  experiments  argue  strongly  for  the  need  to  produce 
separable  systematic  variation  in  both  learning  unit  similarity  and  contextual 
•'ariety. 

From  the  foregoing  review,  intratask  interference  effects  can  be  conceived 
as  a  consequence  of  (a.i  greater  amounts,  eleiboration,  or  distinctiveness  of 
processing  and  the  resulting  encodings  of  items,  required  under  conditions  of 
high  interference  or  difficulty  in  order  to  achieve  satisfactory  short-term 
performance  and/or  (b)  the  development  of  enccxdings  that  are  resistant  to 
interference  and  thus  more  likely  to  be  remembered  under  the  changed  contextual 
conditions  that  typically  characterize  delayed  retention  or  treinsfar  tests. 
These  results  imply  that  whether  or  not  the  intratask  interference  phenomenon 
will  be  found  is  heavily  dependent  upon  degree  of  encoding-retrieval  congruence 
(the  extent  to  which  the  memory  or  transfer  tasks  ate  consistent  with  those  of 
the  original  learning  situation).  This,  of  course,  follows  directly  from 
Tulving's  Encoding  Specificity  Principle  (Tulving  &  Thomson,  1973),  as  well  as 
from  the  results  of  classical  research  on  intertask  transfer  showing  tiiat  any 
substantive  change  from  the  original  earning  or  encoding  conditions  r.ypicaliy 
produces  marked  decrements  in  memory  or  transfer  (Battig,  1979). 

There  i.s  a  further  implication  of  the  research  described  above  t.hat  changes 
in  the  memory  or  transfer  conditions  under  which  intratask  interference  is 
evaluated  should  show  very  different  magnitudes  of  effect,  depending  on  the 
specific  sources  of  acquisition  interference  or  difficuil  y.  It  is  this  line  of 
reasoning  that  forms  the  basis  for  the  distinction  between  variety  of  the 
context  and  similarity  in  the  learning  unit  as  separable  sources  of  the 
p-henomenon.  Contextual  variety  refers  to  changes  over  repeated  trials  (or 
encounters)  in  the  processing  context  under  which  a  given  task  must  be  learned. 
Since  the  primary  effects  of  greater  contextual  variety  should  be  to  produce 
more  elaborate  and  distinctive  encoding,  such  contextua]  variety  should  lead  ’-o 
greater  resistance  to  the  normally  negative  effects  of  chemges  at  the  time  of 
retention  or  transf  tests.  Indeed,  Shea,  Hint,  auid  Zimny  (1985)  reported  that 
the  verbal  protoec.s  of  subjects  trained  on  a  serial  motor  skill  under 
contextual  variability  contain  many  more  references  to  distinctive  and 
contrastive  encodings  than  do  those  of  subjects  trained  under  low  variability. 
Stated  another  way,  encoding  specificity  can  most  effectively  be  overcome  if  the 
original  encodings  have  taken  place  under  high  contextual  variety.  Contextual 
variety,  however,  necessarily  implies  at  least  some  noncorrespondence  between 
tne  acquisition  and  the  retention  or  transfer  testing  conditions,  and  thus 
should  be  relatively  less  effective  when  these  conditions  ace  maximal) y  alike. 
Consequently,  increased  contextual  variety  should  result  in  the  intratask 
interference  phenomenon  primarily  when  context  conditions  are  changed  from  the 
icquisition  to  the  retention  or  transfer  test,  with  the  phenomenon  reduced  or 
■- Umiiiated  when  acqui.sition  conditions  are  maintained  on  the  later  test. 

Similarity  among  the  items  of  the  learning  unit  is  a  source  of  acquisition 
interference  that  induces  additional  processing  consisting  primarily  of  the 


tocTtation  of  organiiiationdl  and  discriminative  tiansformations  appropriate  to 
the  Specific  task  requirements.  Such  additional  processing,  as  is  induced  by 
item  similarity,  should  be  effective  primarily  under  those  specific  similarity 
conditions,  and  is  likely  to  be  of  little  or  no  value  if  the  learning  unit  is 
markedly  changed-  This  leads  directly  to  the  hypothesis  that  increased  item 
similarity  should  result  in  l-etter  transfer  or  retention  primarily  when  the 
I'^arning  unit  is  the  same  for  retention  lor  transfer'  as  under  acquisition,  and 
the  transfer  effects  should  oe  systematically  reduced  or  eliminated  with  tests 
which  incorporate  major  changes  from  the  original  acquisition  conditions. 

Thus,  we  have  one  type  of  intratask  interference  (contextual  variety)  which 
should  produce  positive  effects  on  tran.sfer  and  retention  primarily  under 
changed  retention-transfer  conditions,  and  another  (learning  unit  similarity) 
which  should  result  in  positive  effects  mainly  where  there  is  minimal  change 
from  he  acquisition  to  the  retention  or  transfer  conditions.  Consequently,  a 
combination  of  both  increased  contextual  variety  and  learning  unit  similarity 
should  produce  a  greater  positive  effect  over  a  wider  range  of  retention  or 
transfer  testing  conditions  than  either  of  these  alone.  We  cein  hypothesize, 
therefore,  that  intratask  interference  representing  a  combination  of  learning 
unit  similarity  with  contextual  variety  should  produce  a  greater  positive  effect 
on  transfer  and  retention  than  should  either  of  these  factors  alone,  under 
either  identical  or  changed  conoitions  between  original  acquisition  and 
subsequent  testing. 


Retention  of  Motor  Skills 


Practice  methods.  There  is  considerable  interest  in  the  realms  of  both 
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have  on  retention.  In  the  study  of  these  two  practice  methods,  a  curious 
paradox  surfaced.  As  reviewed  above,  Battig,  as  early  as  1966,  found  that  in 
verbal  learning  tasks,  contextual  interference,  caused  by  randomly  ordering  the 
subjects 's  practice  regimen,  tended  to  hamper  the  acquisition  of  a  task,  yet 
facilitate  its  retention.  This  paradox  was  also  found  to  be  present  in  motor 
learning  tasks  (Shea  &  Morgan,  1979).  Shea  and  Morgan's  task,  however, 
confounded  practice  schedule  effects  (blocked  vs.  random)  with  reaction-time 
paradigm  effects  (simple — inherent  in  blocked  practice  vs.  choice — inherent  in 
random  practice) . 


With  this  problem  in  mind,  Lee  euid  Magill  (1983)  devised  three  experiirients 
to  avoid  these  confounding  effects  yet  came  up  with  the  familiar  paradoxical 
results .  The  task  used  was  a  hinged  barrier  course  simlar  to  that  used  by  Shea 
and  Morgan.  Possible  reaction-time  confounding  was  eliminated  by  using  cued  and 
uncued  practice  trials  and  also  by  adding  a  third  serial  practice  group.  The 
previous  findings  of  Battig  (1966)  and  Shea  and  Morgein  (1979)  were  upheld  in  all 
three  experiments,  with  both  serial  and  randc.m  practice  outperfoiF.ing  blocked 
practice . 

The  suggestion  here  is  that  a  substantially  weaker  memory  results  from 
hlorkc'd  practice  compared  to  random  and  serial.  The  reason  for  this  inferior 
memory  miiht  lie  the  relative  predictability  or  unpredictability  of  the 
practice  scheduh-.  In  a  predictable  (blocked)  '■egimen,  the  subject  relies 
primarily  on  the  predictability  as  the  prime  reinforcer,  idiereas  in  an 
inpredictable  ( random)  regimen,  the  subject  must  utilize  other  sources  oi 
information  to  accomplish  the  task.  With  predictabi ’ Lty  removed,  as  in 
retention  trials,  the  random  pr  tice  group  will  excel  and  the  blocked  practice 
group,  without  their  "crutch,"  will  flounder.  This  explanation  for  the  locus  of 
contextual  interference  is  viable  and  is  right  in  line  with  Schmidt's  (1975) 
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constant  or  variable  knowledge  o£  remits  IKR)  theory  as  a  predictor  of  motor 
task  retention.  Here,  KH  represents  feedback  from  the  experimenter  as  to  how 
close  the  subject  came  to  achieving  the  task  goal.  This  theory  predicts  that 
subjects  receiving  KR  on  every  trial,  or  on  a  consteint  schedule,  come  to  dep^ond 
too  heavily  on  this  source  of  feedback  for  learning.  Subjects  who  do  not 
receive  KR  or  every  trial  but  rather,  on  a  variable  schedule,  are  forced  to 
ut:.lize  ether  zorms  of  feedback  (e.g..  visual,  auditory,  kinesthetic). 
Consequently  those  croups  that  have  learned  to  use  forms  of  feedback  other  than 
KR  perform  better  in  retention  tasks  when  KR  has  been  withdrawn. 

Schema  theory.  In  assessing  skill  retention,  one  person's  performance  or 
learning  (as  measured  by  an  appropriate  transfer  test)  is  usually  compared  to 
another  person's  performance  or  to  some  standardized  performance  criterion. 
Unfortunately,  such  comparisons  provide  no  information  as  to  what  training 
variables  lead  to  performance  differences.  Using  different  types  of  training 
equipment,  for  example,  inay  produce  different  learning  outcomes.  But  more 
importauitly  to  those  in  the  business  of  training,  given  the  same  training 
equipanent,  which  training  methods  result  in  superior  retention? 

One  important  issue  concerning  type  of  training  is  wh'^ther  training  should 
be  constant  or  variable.  Some  (e.g.,  Adams,  197  aintain  that  constant 
training  or  overpractice  of  a  criterion  movsinent  leads  tu  stronger  memory  and 
perceptual  traces,  tiiereby  ensuring  that  later  the  learner  will  be  able  to 
consistently  and  acairately  perform  the  criterion  movement.  Others  (e.g., 
Schmidt,  1975)  maintain  that  variable  practice  or  training  is  more  effective  and 
invoke  a  schema  theory  to  support  their  claim.  According  to  Schmidt,  in 
learning  a  movement  skill,  an  individual  stores  dif  erent  tipes  of  information 
(e.g,;  initial  conditions,  response  specifications,  sensory  consequences,  and 
response  outcomes)  and  uses  tills  information  to  form  a  relational  sc.herna,  or 
cognitive  representation  of  the  task.  Later,  this  schema  can  be  used  to  produce 
the  required  movements  in  a  task.  Ttie  larger  or  more  diverse  the  base  of 
information  forming  the  schema,  the  stronger  are  the  informational 
relationships,  the  schema  itself,  and  hence,  the  probability  of  producing  the 
criterion  task.  To  achieve  this  larger  or  more  diverse  base  of  information, 
Schmidt  recommends  increasing  the  variability  of  training  or  practice  around  the 
criterion  movement — an  approach  opposite  to  that  ’'ecomrnended  by  Adams  of 
practicing  the  criterion  movement  only. 

Long-term  retention  of  so  tor  skills.  Because  comprehensive  literature 
reviews  on  the  long-term  retention  of  motor  skills  ate  available  (e.g.,  Naylor  & 
Briggs,  1961;  Schendel,  Shields,  &  Katz,  1978),  this  section  will  focus  on 
summarizing  the  major  firdings  of  this  field  and  relating  them  to  the  retention 
of  perceptual  2md  cognitive  skills. 

Schendei  et  al.  (1978)  have  concluded  tl  it  the  level  of  original  training 
is  the  single  most  important  determinant  of  motor  skill  retention.  This  general 
conc.lusion  was  also  reached  by  Bahrick  (1964)  in  his  study  on  the  long-term 
retention  of  knowledge  concerning  Spanish.  Additionally,  mastery  training 
(overlearning)  .an  increase  long-term  retention  of  motor  skills  beyon  the  level 
attained  with  proficiency  (criterion)  training.  Mastery  training  has  also  been 
shov/i'.  to  counteract  the  negative  effect  of  high  levels  of  arousal  or  anxiety,  an 
inpoctant  consideration  for  skills  needed  in  emergency  situations. 

Refresher  training  and  mental  rehearsal  have  been  shown  to  aid  the 
long-term  retention  of  motor  skills.  The  effect  of  re-exposure  to  a  skill  not 
only  reinforces  knowledge  previously  learned  but  can  also  produce  new  learning. 
Although  Bahrick  (1984)  did  not  .find  evidence  for  an  effect  of  rehearsal  on  the 
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retention  ;f  the  Spanish  language,  the  levels  of  rehearsal  reported  may  have 
been  too  jw  to  produce  an  enhancement  of  !  tention. 

\.>ie  incividual  ability  of  subjects  in  a  training  program  car  be  a  good 
predictor  of  speed  of  learning  and  retention.  That  is,  s\ibjects  wit.'  high 
ability  will  learn  a  skill  to  a  perfotmnce  criterion  faster  and  retain  it  .  t  a 
higher  level  than  will  subjects  with  lower  ability  levels.  It  should  be  noted, 
however,  that  the  better  retent.ion  of  high  ability  subjects  may  be  due  to 
differences  in  the  degree  of  original  learnirig,  and  that  if  ail  subjects  are 
trained  to  the  same  degree,  i3ng--term  retention  may  be  unrelated  to  initial 
ability  differences. 

An  important  consideration  in  the  evaluation  of  skill  retention  is  the 
raarner  in  which  it  is  measured.  If  retention  is  measured  from  the  first 
performance  of  a  skill  at  the  end  of  the  retention  interval,  subjects  will,  in 
general,  show  poor  retention.  However,  performance  improves  quickly  after  the 
first  retention  testing  and  if  retention  is  indexed  by  the  speed  of  relearning 
to  criterion  performance,  subjects  will  show  a  higher  level  of  retention.  In 
other  words,  after  a  long  period  of  disuse,  skill  performance  will  initially  be 
"rusty,"  but  after  a  short  period  of  practice,  performance  will  return  to  its 
original  leve.l.  Thus,  an  important  consideration  for  retention  training  is 
whether  or  not  a  period  of  relsatni.nq  is  feasible.  If  relearning  is  not 
feasible,  as  in  emergency  situations,  then  a  more  intensive  retention  training 
program  should  be  considered  to  ensure  a  high  level  of  performarice  in  the  first 
execution  after  a  long  retention  interval. 

The  marked  difference  between  retention  measured  from  the  first  re-exposure 
and  the  speed  of  relearning  forres  us  to  .juaiiiy  the  concept  of  permastore. 
Bahrick's  conception  of  permastcce  refers  to  the  former  measure  of  retention; 
that  is,  the  information  which  can  be  accessed  without  any  relearning.  Thus, 
permastoce  represents  the  "tip  of  the  iceberg"  of  overall  retention  because  much 
of  what  is  retained  manifests  Itself  only  after  some  relearning. 

Schendel  et  al.  (1978)  concluded  that  the  distribution  of  practice  (massed 
vs.  spaced)  does  not  appear  to  influence  the  retention  of  motor  skills.  This 
finding  is  in  marked  contrast  to  those  from  studies  showing  strong  effects  of 
practice  distribution  in  many  cegnitive  skills  and,  in  particular,  to  the  work 
of  Bahrick  (1979).  A  possible  explanation  for  this  discrepauncy  pertains  to  the 
co  iciusion  of  Bahrick  that  the  spacing  of  learning  sessi  'ns  should  be  as  long  as 
the  retention  interval.  Many  of  the  stixiies  cited  by  St.hendel  et  al.  as  not 
showing  a  practice  distribution  effect  for  motor  skills  compare  inm<  iiato  trial 
repetitions  to  ones  spaced  only  seconds  ipart.  Retention  was  then  tested  after 
a  relatively  long  retention  interval  (e.g.,  1  month).  Perhaps  the  spacing  of 
the  learning  trials  was  too  short  relative  to  the  retention  interval  to  produce 
a  detectable  difference  in  retention.  Further  research  is  needed  to  examine 
this  possibility  before  practice  distribution  is  discarded  as  a  consideration  in 
a  skill  retention  program. 

Discovery  Learning 

Ate  procedure.s  better  retained  or  mote  likely  to  appear  in  permastore  after 
discovery  or  aftei  expository  learning?  Which  method  of  learning  would  provide 
for  better  transfer  of  knowledge  to  new  problems?  These  ace  questions  which 
address  the  long-debated  issue  of  -whether  discovery  or  expositor^'  learning  is 
better  (i.e.,  makes  information  easier  to  retrieve  from  mjinory  and  to  transfer 
to  new  situations).  Bruner  (1961)  argued  that  discovery  learning  is  beneficial 
for  a  variety  of  reasons,  such  as  the  fact  that  discovery  causes  the  learner  to 
organize  the  material,  which  would  lead  to  better  retrievability  than  if  the 
mateiial  were  rote  nsemorized.  Also,  di.sco’.’ery  would  allow  ^-he  learner  to  form 
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an  hierarchically  higher-^evel  conceptualization  of  the  material;  it  also  might 
c  (use  the  learner  to  find  alternate  solutions  to  a  problem  rather  than  merely 
tlie  "right  way  to  do  it,"  which  would  provide  flexibility  in  problem-solving 
ability  and  would  aid  in  transfer  to  different  situations.  Friedlander  (1965), 
on  the  ocher  hand,  provided  a  number  of  arguments  against  the  superiority  of 
discovery  learning.  He  noted  th.it  there  are  skills  and  facts  which  must  be 
memorized  before  a  larger  body  of  knowledge  can  be  learned.  'kills  and  facts, 
after  being  memorized,  may  yield  higher-level  conceptual  i.^ations.  /'J.so, 
discovery  learning  usually  takes  mots  time  than  does  expository  learning,  and 
the  learner  may  be  led  to  wrong  or  inappropriate  conclusions. 


Kichard  Mayer,  James  Greeno,  and  their  colleagues  (see  Mayer,  1975,  for  a 
review)  attempted  to  test  this  question  systematically  and  found  that  been 
Bruner  eind  Friedlander  are  correct.  In  most  of  these  studies,  subjects  learned 
to  solve  various  probability  problems,  which  could  be  represented  in  equation 
form  (e.g.,  b;  nomial  probability,  Bayes  theorem).  Fgan  and  Greeno  (1973)  had 
subjects  so] 76  problems  amd  generalize  with  few  instructions  (the  discovery 
group),  or  had  subjects  solve  problems  and  gave  instructions  (the 
expository,  or  rule,  group).  Before  training,  all  subjects  were  given  pro-tests 
to  test  their  knc'- ’edge  of  probabilistic  concepts  and  their  ability  to  solve 
computations  and  use  permutation  strategies.  Subjects  were  tested  after 
training.  Measures  of  performance  were  number  of  errors  and  speed  of  solution. 
There  were  no  overall  differences  between  the  discovery  and  rule  groups  on  the 
post-test.  However,  subjects  in  the  rule  group  performed  uniformly  well  on  the 
post-test,  but  subjects  in  the  discovery  group  performed  better  on  the  post-test 
if  they  had  higher  ability  (as  measured  by  the  pre-tests).  Also,  low  ability 
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and  Grseno  postulated  that  the  discovery  subjects  were  forced  tc  invoke  past 
knowledge  in  order  to  solve  the  problems,  and  subjects  with  greater  ability  'were 
better  able  to  do  this.  On  the  other  hand,  subjects  in  the  'ule  group  were 
adding  a  piece  of  information  which  was  relatively  independent  of  their  past 
knowledge,  so  previous  abilities  and  knowledge  were  less  importemt  in 
deteemining  their  performance. 


Ocher  experiments  (Mayer,  1974;  Mayer  &  Greeno,  1972;  Mayer,  Stielh,  & 
Greeno,  1975)  u  .ed  two  expository  training  groups.  In  one  group,  general 

concepts  familiar  to  the  subjects  were  used  to  teach  sv  jects  to  solve  the 

rotaability  problems,  and  thus,  their  general  knowledge  was  invoked  in  the 
learning  process,  in  the  other  group,  subjects  were  given  the  appropriate 
probability  formula  emd  practiced  using  this  formula;  the  formula  was  a  piece  of 
information  which  subjects  did  not  need  to  integrate  with  past  knowledge.  The 
results  were  the  same  as  for  the  discovery  and  rule  groups  in  the  Egein  and 

Greeno  (1973)  experiment.  Previous  ability  was  important  for  subject.^  in  the 
general  concept  group's  performance,  and  ability  was  not  significauitly  related 
to  perform  nee  for  subjects  in  the  rule  group.  They  also  found  that  subjects 
trained  with  the  formula  performed  better  than  subjects  trained  with  general 

concepts  on  near  transfer  problems  (problems  similar  to  training  problems),  but 
subjects  in  the  general  concept  group  performed  better  on  far  transfer  problems 
(pcoblfeTis  reqiiiring  interpretation  in  order  to  be  able  to  use  the  information 
gained  in  training).  They  found  that  these  differences  in  trar  .'c  ability 
occurred  early  in  learning  and  remained  consistent  throughout  greater  levels  of 
traini.ng.  Mayc-r  (1974)  also  forond  that  subjects  performed  as  well  with  a  closed 
book  test  (notes  not  available)  as  they  did  with  an  open  book  test  (notes 
available).  Thus,  the  knov/ledge  necessary  to  solve  the  problem  is  stored  in 
memory,  but  .subjects  seem  to  liave  dif^'iculty  in  knowing  when  to  apply  this 
knowledge.  Subjects  who  were  taught  general  concepts  were  better  at  applying 
their  knowledge  to  conceptual  problems  and  sub-problems  of  those  solved  in 
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training.  Subjects  wtio  v/ere  taught  the  forntuia  were  better  at  applying  their 
knowledge  to  problems  similar  to  those  solved  in  training. 

In  these  studies,  and  in  most  of  the  releveint  literature  reviewed,  tests 
were  given  the  same  day  as,  or  soon  after,  the  training  period.  Few  long-term 
effects  due  to  discover^'  and  expository  training  have  been  studied.  Two 
exceptions  are  studies  by  Solter  and  Mayer  (1978),  who  studied  children  learning 
the  concept  of  one-to-cne  correspondence,  and  Singer  and  Pease  (1975),  who 
studied  the  learning  of  a  serial  motor  task.  The  same  general  pattern  emerged 
from  both  studies.  S’objects  in  discovery  and  e  pository  groups  were  trained  to 
the  same  criterion,  and  given  a  treinsfer  test  at  1  and  3  weeks  (Solter  Mayer) 
or  after  2  days  (Singer  &  Peaee).  Discoven/  subjects  performed  better  in  the 
first  transfer  test  than  did  guided  subjects,  but  this  difference  disappeared  on 
the  next  similar  transfer  test.  The  elimination  of  this  difference  between  the 
groups  after  one  test  sterns  odd  if,  as  Mayer  and  his  colleagues  claimed,  and  as 
the  concept  of  petmastor .  implies,  different  types  of  training  cause  different 
types  of  cognitive  structures  to  form.  However,  it  may  be  that  subjects  do 
initially  have  different  structures,  but  ne  test  in  which  sidojects  must 
discover  the  solution  is  enough  to  get  tne  guided  group  to  invoke  their  past 
knowledge,  as  the  discovery  group  had  done  in  training. 

McDaniel  and  Schlager  (1985)  studied  discovery  and  expository  learning  in 
tile  domains  of  water- jar  and  rivet-crossing  problems.  They  found  that 
strategies  can  be  learned  by  discovery  training.  Discovery  subjects  solved 
transfer  problems  faster  them  did  expository  subjects,  but  only  if  the  same  kind 
cf  information  (e.g.,  general  strategies  or  procedures)  had  to  be  discovered  in 
transfer  as  was  discovered  in  training.  Knowing  when  to  activate  past  relevamt 
knowledge  aeciwed  to  aid  in  tvancfsr.  Disoo'/ery  subjects  seemed  to  have  learned 
how  to  search  efficiently  and  where  to  stop  a  search. 

The  apparent  positive  effect  of  discovery  on  transfer  performance  is 
important,  but  perhaps  not  as  important  as  another  effect  of  discovery.  Many 
researchers  have  noted  that  subjects  trained  by  discovery  are  highly  motivated 
to  learn,  whereas  subjects  trained  expositorily  find  the  task  uninteresting  and 
monotonous.  Kersh  (1958)  reported  that  some  discovery  siibjects  later  tested 
their  friends  on  the  material  they  learned  or  went  to  the  library  to  look  up 
further  information.  Most  expository  subjects,  on  the  other  hand,  complained 
that  the  task  was  boring,  and  they  did  not  try  to  remember  the  rules  because  no 
one  told  them  to  do  so.  Not  surprisingly,  discovery  subjects  retained  the  rules 
they  learned  better  than  did  the  expository  siobjects. 

In  conclusion,  it  seems  that  discovery  learning  is  beneficial  in  its 
motivating  effects  aurid  in  getting  learners  to  integrate  new  information  with 
their  past  knowledge,  which  aids  in  transfer.  But  subjects  must  first  know  the 
prerequisite  concepts.  However,  if  the  learner  needs  to  use  only  the  specific 
knowledge  learned  in  training  (e.g.,  using  a  certain  equation),  then  expository 
training  is  more  efficient,  and  application  of  that  knowledge  will  be  more 
efficient.  There  is  no  strong  evidence  that  eitlier  typje  of  learning  causes 
better  retention  of  information;  long-term  studies  are  necessary  to  determine 

remains  of  the  information  learned  by  each  method. 

Decay  of  Skills 

A  review  of  the  relevant  psychological  literature  reveals  that  there  has 
been  a  great  deal  of  v/ork  on  the  acquisition  cf  skills,  but  relatively  little  on 
the  mai-itenance  of  skills  once  they  have  ’een  acquired.  For  example,  much  is 
known  about  the  acquisition  of  simple  short-term  motor  skills,  of  both  a 
discrete  (e.g.,  placing  small  cylindrical  blocks  into  holes;  Kimble  &  Bilodeau, 
1949;  and  continuous  (e.g.,  oursuit  rotor;  Adams,  1968)  nature;  about  the 
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development  of  long-term  mnenionic  strategies,  such  as  memorizing  lengthy 
sequences  cf  digits  (see,  e.g.,  Ericsson  &  Chase,  1982);  and,  from  a  more 
practical  standpcinc,  about  learning  information  in  the  classroom,  such  as  how 
to  solve  mathematics  pr  hlems  (see  e.g.,  Schoenfeld,  1979). 

Although  there  is  cor  iderably  less  in  the  literature  on  skill  decay,  there 
ire  several  recent  studies  which  provide  important  insights  into  the  question  cf 
how  to  counteract  decay,  both  during  and  after  skill  acquisition.  One  study  by 
Geoffrey  Loftus  (19£c)  addressed  the  crucial  prelim:  nary  issue  of  iiow  to  measure 
decay  eind  how  to  compare  decay  rates  for  differing  degrees  of  original 
acquisition  (although  t*'e  retention  interva.ls  considered  by  Loftus  were  not 
particularly  long).  More  specifically,  Loftus  reviewed  work  by  Slamecka  and 
McElree  (1983)  in  which  subjects  1 larned  verbal  material  to  differing  degrees  of 
acquisition.  Subjects  were  later  tested  after  acquisition  at  retention 
intervals  varying  from  0  to  5  days.  Slamecka  and  McElree  found  that  the  degree 
of  acquisition  did  not  interact  with  retention  interval  (so  that  the  difference 
in  performance  levels  on  the  items  learned  to  differing  degrees  of  acquisition 
was  as  great  after  long  retention  intervals  as  after  short  retention  inter/ais), 
and  they  concluded  that  decay  was  independent  of  degree  of  acquisition.  Loftus, 
on  the  other  hand,  examined  the  same  data  from  a  different  perspective  and 
reached  the  opposite  conclusion.  He  assessed  how  much  decay  time  is  required 
for  performance  to  fall  from  any  given  level  to  some  lower  level,  and  he 
diagnosed  decay  rates  as  different  whenever  such  decay  times  differ.  Si..ce  he 
foxind  that  it  took  less  time  for  a  drop  from  one  level  of  perform'-*-'  ...  .-.o  a  lower 
level  of  performance  when  the  initial  acquisition  degree  was  lower  (the  amount 
of  overlearning  was  less),  he  concluded  that  deca”  is  slower  under  high 
acquisition  conditions  than  it  is  under  low  acqtvisition  conditions.  This 
hypcjthesis  -was  originally  proposed  by  Jest,  *who  tormulated  the  following  law; 
"If  fwo  associations  are  now  of  equal  strength  but  of  different  ages,  the  older 
one  wi' I  lose  strength  mcjre  slowly  with  the  further  passage  of  time"  (Woodworth 
&  Schiosberg,  1954,  p.  730).  This  work  by  Loftus  not  only  provides  important 
measurement  tools  for  assessing  decay  rate,  but  it  also  reaffirms  the  important 
rule  that  skills  can  be  acquired  to  differing  degrees  aiid  those  differences  will 
influence  the  extent  to  which  the  skills  are  maintained. 

Proposed  Theoretical  Mechanisms  Underlying  Skill  Retention 


Automatism 

Our  discussion  of  proposed  theoretical  mechanisms  underlying  skill 
maintenance  will  focus  in  part  on  a  conjecture  that  follorws  from  recent  studies 
reported  in  the  psychological  literature,  even  though  these  studies  have  not 
directly  addressed  the  issue  of  how  to  maintain  skills.  This  conjecture  is  that 
ma.-'.ntenance  of  a  skill  will  depend  crucially  on  whether  that  skill  is  automatic; 
i.e.,  can  be  performed  without  conscious  awareness.  In  particular,  we  propose 
that  automatic  skills  will  persist  in  time  with  little  need  for  rehearsal  or 
refreshing.  This  conjecture  derives  from  recent  research  in  cognitive 
psychology  which  has  drawn  an  important  distinction  between  automatic  and 
controlled  processes  (e.g.,  Schneider  &  Shiffrin,  1977).  Skills  that  require 
only  minimal  cognitive  capacity  and  attention  to  perform  are  either  fully  or 
partly  automatic;  those  that  requite  resources  and  cognitive  effort  involve 
controlled  processes.  The  classification  of  a  skill  as  automatic  or  controlled 
depends  in  large  pact  on  the  degree  cf  prior  skill  acquisition.  Many  skills 
employ  controlled  processes  during  the  initial  stages  cf  acquisition,  but  these 
processes  become  automatic  with  extensive  practice.  Consider,  for  example,  the 
skills  involved  in  riding  a  bicycle.  Much  attention  cind  effort  are  expended  uy 
those  initially  learning  tnis  task;  learners  itrust  concentrate  on  every  movement 
they  make  in  order  to  keep  on  track  and  avoid  falling  down.  However,  after 
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considerable  practice,  those  riding  bicycles  can  concentrate  their  thoughts  and 
attention  to  other  activities,  such  as  the  scenery  or  conversation  (see  Reed, 
1982,  pp.  50-51). 

In  order  to  test  cur  conjecture  concerning  the  persistence  of  automatic 
skills,  we  need  a  clear  definition  of  and  a  set  of  criteria  for  automatism. 
Althouch  the  concept  of  aucoma.tism  has  been  widely  used  by  psychologists  in 
recent  years,  there  has  been  considerable  debate  concerning  its  defining 
attributes.  Three  prominent  lines  of  research  are  parti-cularly  relevant  to  this 
issue.  The  first  '.ine  derives  from  the  seminal  work  of  Schneider  and  Shiffrin 
(1977;  Shiffrin  &  Schneider,  1977).  This  work  has  largely  been  fccr  ■  on  the 
role  of  attention  in  simple  perceptual  tasks  like  target  detection.  The  second 
line  derives  from  the  influential  work  by  LaBerge  and  Scunuels  (1974),  rvhich  has 
been  addressed  in  large  part  to  the  development  of  lutomatic  subprocesses  in  the 
complex  task  of  reading  text.  The  third  li^'-e  of  research  was  initiated  by 
Hasher  and  Zachs  (1979)  and  is  centered  on  men'ory  activities.  Different,  though 
overlapping  methods  and  criteria  for  automatisii  have  been  developed  by  each  of 
these  sets  of  researchers . 

Schneider  amd  Shiffrin.  After  providing  a  review  of  the  recent  studies 
using  their  approach  to  automatism,  Schneider,  Dumais,  and  Shiffrin  (1984) 
proposed  a  two-part  definition  for  automatic  processes:  First,  the  process  must 
not  make  use  of  or  deplete  the  general,  non-specific  cognitive  resources. 
Second,  the  process  must  be  carried  out  in  response  to  the  presence  of  the 
relevant  external  .stimuli  even  when  subjects  attempt  to  ignore  the  stimuli .  In 
other  words,  capacity  reductions  do  not  affect  automatic  processing,  and 
automatic  processes  ate  not  subject  to  conscious  control.  How  do  automatic 
nrnrpR.<;ps  dpvplon?  Extensive  orartlce  is  needed  to  ensure  automat.1sm.  but  not 

^  e 

all  types  of  practice  are  sufficient.  Consistency  of  practice  is  che  major 
factor.  Schneider  and  Shiffrin  compared  two  types  of  practice  situations 
differing  only  in  the  amount  of  consistency  in  training;  with  "consistent 
mapping,"  the  si±iject  makes  the  same  response  each  time  a  particular  stimulus  or 
a  particular  class  of  stimuli  occurs,  whereas  with  "varied  mapping, "  the 
responses  to  stimuli  change  across  training  trials. 

One  paradigm  used  extensively  by  Schneider  amd  Shiffrin  (197  )  to  compare 
consistent  and  varied  mapping  was  the  "multiple  frame  visual  search"  task-  In 
this  ta.$k,  subjects  are  presented  a  series  of  frawies  successively,  one  after 
einother  with  virtually  no  delay  between  frames.  Each  frame  is  presented  very 
briefly  for  a  duration  called  the  "fraune  time."  Before  a  trial,  or  seqtience  of 
frames,  is  started,  the  subjects  are  given  a  set  of  items,  called  the  "memory 
set,"  and  are  told  to  make  a  "yes"  response  to  ainy  item  from  the  memory  set  that 
occurs  in  the  following  sequence  of  frames.  In  the  experiments  reported  by 
Sclmeider  and  Shiffrin,  each  trial  consisted  of  the  presentation  of  20 
successive  frames  to  which  the  subject  was  to  respond  "yes"  or  "no”  depending  on 
whetlier  or  not  the  sequence  included  the  presence  of  a  memory  set  item;  the 
dependent  variable  was  detection  accuracy  (the  percentage  of  hits  and  fals< 
alarm  responses);  and  tlie  independent  variables  included  frame  time,  frame  size 
(the  number  of  characters  occurring  in  each  frame,  typically  ranging  from  one  to 
four),  the  size  or  number  of  items  included  in  the  memory  set,  and  (most 
crucially)  the  type  of  mapping,  consistent  or  varied.  For  example,  in  one 
consistent  mapping  condition,  the  memory  set  items  were  always  digits  which 
occurred  embedded  v/ithin  letters,  whereas  in  the  analogous  varied  mapping 
condition,  the  memory  set  consisted  of  some  random  subset  of  the  letters.  Tne 
differences  between  the  two  mapping  conditions  were  striking;  Subjects  required 
a  frame  time  of  120  msec  or  less  to  achieve  a  hit  rate  of  approximately  95%  in 
the  consistent  mapping  condition,  whereas  even  frame  times  as  long  as  800  msei 
did  not  always  l^'ad  to  such  a  high  level  of  performance  in  the  varied  mapping 
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condition.  Further,  performance  in  the  varied  mapping  condition  was  strongly 
influenced  by  frame  size  and  memory  set  size,  but  the  effects  of  those  variables 
on  consistent  mapping  were  minimal.  Hence,  consistent  mapping,  but  not  varied 
mapping,  satisfied  a  specific  rendition  of  the  first  part  of  the  definition  of 
automatism:  Changes  ir  attention  and  memory  load  had  essentially  no  effect  on 
detect. on  accuracy. 

CaBerge  and  Samuels .  taBerge  and  Samriels  (1974)  proposed  a  theory  of 
readi.ng  based  ~on  the  autcmatization  of  its  components,  goading  is  assumed  to 
proceed  in  a  secies  of  hierarchical  stages  beginning  with  the  identif icatj on  of 
individual  letters,  combining  letters  into  words,  activating  the  meaning  of  the 
words,  and  combining  these  meanings  into  successively  larger  groups  of 
comprehended  discourse.  Because  attention  is  limited,  reading  skill  is 
constrained  oy  the  degree  to  which  the  "lower  level"  compc  "lents  are  autortiatad. 
When  children  are  first  learning  to  read,  their  attention  is  directed  toward 
identify!. ig  letters  and  combining  them  into  words.  Through  practice,  these 
components  become  autoioated,  and  attention  can  be  focused  on  identifying  the 
words  as  units.  Through  further  practice,  successively  higher  levels  of 
processing  become  automated  up  to  tlie  point  where  attention  can  be  focused 
solely  on  the  gist  of  the  text.  The  readers  also  have  the  ability  to  focus 
their  attention  on  lower  levels  of  processing  when,  for  example,  a  high  degree 
of  accuracy  is  required  or  they  are  reading  aloud.  In  support  of  their  theory, 
LaBerge  and  Samuels  (1974)  showed  that  performance  on  identification  and 
matching  tasks  benefited  from  practice  for  novel  letters  but  not  for  familiar 
letters;  presumably  this  processing  stage  had  become  automated  for  familiar 
letters,  but  initially  required  attention  and  then  became  automated  for  the 
novel  letters. 

This  theory  of  automatism  repiesents  suiwewhat  of  a  departure  from  that  of 
ohiffrin  and  Schneider  (1977).  Although  it  assumes  that  some  components  of 
reading  are  truly  automated,  it  also  allo'.'^s  for  the  flexibility  to  focus 
attention  on  an  automatic  component  if  t.he  task  requires  it. 

(X.e  important  consequence  of  automatism,  as  stressed  b/  UiBerge  and  Samuels 
(1974)  in  their  work  on  reading,  is  that  th.e  size  of  the  processing  units 
increases  as  tasks  become  automatic.  Much  research  has  been  conducted  in  recent 
years  that  employs  a  simple  detection  task  to  investigate  the  size  and  nature  of 
the  processing  units  used  when  readhg  printed  text,  'fhese  investigations  rest 
on  the  assumption  that  once  individuals  have  abstracted  a  urut,  they  no  longer 
concern  themselves  with  its  constituent  parts.  The  detection  task  is  thus  used 
to  indicate  the  size  of  an  individual's  processing  units  by  revealing  which 
constituents  are  ignored  by  the  individual.  For  example,  in  one  version  of  this 
task,  subjects  have  tjeen  asked  to  read  a  passage  of  text  and  circle  every 
instance  of  a  given  target  letter  (e.g.,  the  letter  t) .  It  has  been  found  that 
mote  error  are  made  on  very  common  words,  like  the  word  the,  than  on  rare 
words,  liki  (Healy,  1976).  These  results  have  been  explained  by  a 
unitization  model  (see  Drewnowski  &  Healy.  *977;  Healy,  1980;  Healy  & 
Drewnowski,  1983),  according  to  which  subjects  miss  letters  on  familiar  words 
like  the  because  they  process  such  words  automatically  in  units  larger  than  the 
letter  v/ithout  completing  processing  at  the  letter  level. 

Haslw  and  Zacks.  Hasher  and  Zacks  (1979)  proposed  tha  memory  processes 
var;,  '  ong  a  continuum  of  the  amount  of  limited-capacity  attentional  resources 
these  processes  consume.  At  one  end  of  the  continuuis  ate  jucely  automatic 
processes.  These  proces.ses  requ.ire  virtually  no  attentional  capacity  and  can 
therefore  !  j  performed  in  paralle.1  with  other  processes.  These  skills  are 
ujiaffec'^ed  by  contextual  vacieibles  such  as  arousal  levels,  and  are  minimally 
changed  by  development  or  practice.  Examples  of  such  autom  ited  processes  are 
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the  encoding  of  spatial,  temporal,  and  frequency  information.  Processes  at  tiiit. 
end  of  the  continuum  are  presumed  to  be  innate  wt.ereas  some  less  polarized 
automatic  skills  can  be  acquired  through  practice,  such  as  the  encoding  of  the 
mean  ng  of  a  word.  At  the  other  end  of  the  attention  consumption  continuum  ave 
"effortful"  processes.  These  tasks  require  a  grett  deal  of  attentional 
resources  and  are  likely  to  interfere  with  other  resourcr^-consiiming  ac  ivities 
operating  in  paralJ.el.  Effortful  processes  are  assumed  to  he  influe '.ced  by 
contextual  factors  and  practice.  Examples  of  effortful  processes  are  re  ear  sal 
and  mnemonic  elaboration. 

Hasher  and  2acks  (1979)  used  this  fr2unewotk  to  account  for  the  effects  of 
instruction,  practii  interference,  arousal,  and  development  on  memory  tasks 
which  are  pr-^dicted  tu  be  either  automatic  or  effortful.  The  oredictions  are, 
in  general,  consi.stent  with  th'  findings  cited  by  Hasher  6uid  Zacks.  One  note^ile 
inconsistency  is  an  aging  difference  in  memory  for  frequency,  a  presumably 
automatic  process. 

This  framework  mak^i  soiiu  interesting  predictions  about  the  mainteneince  of 
skills.  For  example,  automa' ed  components  of  a  skill  which  were  not  developed 
tV'-ough  practj  e  should  not  be  expected  to  deteriorate  when-  the  skill  is  not 
t  "d.  It  should  therefore  be  unnecessary  to  include  them  in  a  maintenance 
program.  Additionally,  less-po' prized  automated  components  developed  through 
practice  would  be  expected  to  snow  a  strong  benefit  from  a  maintenance  program. 
This  is  -gredicted  because  practice  should  help  to  ensure  that  the  process  does 
not  return  to  an  effortful  state  and  reduce  the  cognitive  capacity  necessary  for 
I  ther  ongoing  operations. 

Assessing  automatism.  After  reviewing  much  of  the  research  on  automatism, 
Jonides,  Navph-Benjamin,  and  Palmer  (1985)  oroposed  two  principles  that  should 
be  followed  in  the  study  of  aucoroatism  on  cognitive  processes.  The  first  of 
these  principles  was  that  the  concept  of  automatism  is  best  applied  to  component 
processes  of  a  complex  task,  not  to  the  task  ac  a  whole.  The  concerns  here  are 
twofold.  One,  in  judging  a  task  to  be  automatic,  the  experimenter  might 
overlook  some  component  processes  that  are  not  themselves  automatic. 
Conversely,  the  second  concern  is  that  by  judging  a  task  not  to  be  automatic, 
the  experimenter  i  ay  overlook  component  processes  that  are,  by  themselves, 
automatic.  To  av'oid  these  pitfalls,  Jonides  et  aX,  suggested  constructing 
sensitive  tests  utilizing  aun  explicit  model  of  the  processes  involved  in  the 
task . 


The  second  principle  Jon  des  et  al.  proposed  was  that  the  criteria  chosen 
for  the  evaluation  of  automatism  should  be  motivated  by  the  processes  involved 
in  the  task,  not  by  individual  criteria  (such  as  minimal  attenticnal  demands  and 
lack  of  voluntary  control)  attrit  ited  to  generic  automatic  tasks.  The  example 
used  to  fortify  this  principle  comej  from  the  research  comparing  maintenance  amd 
elaborative  rehearsal  (Naveh-Benjamin  &  Jonides,  1984).  Thrae  criceria  were 
used  to  assess  the  autcanatism  of  both  forms  of  rehearsal;  demand  on  capacity, 
susceptibility  to  interruption,  emd  stereotype.  The  importauice  of  listing  these 
criteria  is  that  though  stereotype  is  atypical  of  the  criteria  that  usually 
appear  in  toe  automatism  literature,  it  proved  to  be  particularly  effective  in 
derermining  the  relative  automatism  of  elaborative  and  maintenance  rehearsal 
methods. 

The  cot  elusions  drawn  are  that  before  an  autexnatism  stridy  is  conducted,  an 
explicit  model  of  the  component  process  of  a  task  should  be  formulated  eind  that 
once  formulated,  the  criteria  for  automatism  should  he  relevant  to  those 
specific  component  processes,  not  tasks  in  general. 
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Central  to  the  tudy  of  automat tis'  •  through  actention  is  the  t<elieJ;  that  the 
human  is  a  limited  capacity  procc-  or.  That  is,  there  is  a  limited  amount  of 
resources  from  which  control  processes  c^kn  draw  in  attempting  to  perform  a  task. 
Automatic-control  processing  theory  assuates  that  capacity  limitations  arise  out 
of  competition  between  concurrent  control  processes.  Hence,  'vhen  concurrent 
performance  of  tasks  causes  the  control  process  demands  to  exceed  the  ca:'acity 
limitations,  there  is  a  decrement  in  performance  in  one  or  both  of  the  tasks  due 
to  lack  of  resources.  In  contrast  to  control  processes,  automatic  processes  can 
occur  in  parallel,  free  from  the  limitations  of  control  processes,  without  any 
noticeable  decrement  in  performance  (wickens,  1980).  A  middle  ground  approach 
to  this  control-automatic  dichotomy  prot  'ses  that  as  a  task  becomes  better  and 
better  learned,  its  resources  (attencional)  demands  gradually  decline  until 
ultimately  demands  are  minimal  and  the  task  is  assumed  to  be  automatic  (Logan, 
1979). 


With  capacity  t.  ?oty  ia.g.,  Kahneman,  1973)  in  hand,  tlie  concern  swj  tches 
to  the  measurement  of  attention.  The  paradigm  most  used  to  measure  the  resource 
utilization  of  a  task  is  the  dual-task  paradigm  (e.g.,  Posner  &  Boies,  .''^71). 
The  dual-task  paradigm  permits  the  comparison  of  two  tasks  in  terms  of  l  mmon 
units.  This  comparison  is  achieved  through  tlie  use  of  a  s’  bsidiary  task  setup. 
In  effect,  the  subjects  are  given  two  tasks,  a  prima-y  task  which  they  are 
instructed  to  do  as  well  as  they  can  cind  a  secondary  task  which  they  are  to 
attempt  only  after  successfully  f  mpletii  g  the  primary  task.  In  this  method, 
the  measure  of  attention  required  foi.  the  primary  task  is  inversely  related  to 
tile  performance  of  the  secondary  task.  As  performance  increases  in  the 
secondary  task,  it  is  presumed  that  the  attentional  demand  for  the  first  task  is 
decreasing.  Conversely,  should  perfornance  on  the  secondary  task  be  low. 
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dual-task  setup  would  be  to  have,  the  subject  read  a  passage  while  simultaneously 
detecting  letters  (Proctor  &  Healy,  1985)  or  read  a  passage  while  simultaneously 
transcribing  dictation  (Spelke,  Hirst,  &  Neisser,  1976). 


Finally,  caution  should  be  taken  in  comparing  primary^  tasks  in  a  dual-task 
paradigm,  as  the  disruption  of  the  secondary  task  depends  not  only  on  the 
difficulty  of  the  primary  task  but  also  on  its  structure.  Secondary  tasks  may 
interfere  differently  on  differing  primary  tasks,  as  would  be  the  case  in  the 
comparison  of  a  task  with  a  high  motor  load  to  a  task  with  a  high  conceptual  or 
perceptual  load.  Perhaps  the  safe  road  to  take  would  be  a  modification  of  the 
dual-task  paradigm,  where  a  battery  of  secondary  task"  is  used  instead  of  just 
one  such  task. 

Alternatives  ^  Automatism 

Hirst,  and  Neisser.  Although  the  notion  of  automatism  has  been 
widely  accepted,  some  data  have  suggested  the  need  for  alternative  approaches. 
Spelke,  Hirst,  and  Neis.ser  (1976),  for  exaunple,  trained  two  people  to  copy 
dictated  words  while  reading  stories.  After  considerable  practice,  the 
particip2unts  were  able  to  do  both  ta.sks  as  well  simultaneously  as  they  could  do 
them  alone.  S'jbsequent  experiments  provided  evidence  against  the  hypothesis 
that  parti'  i.pants  were  alternating  their  attention  between  the  two  tasks. 
Within  the  same  paradigm,  Hirst,  Spelke,  Reaves,  Caharack,  and  Neisser  (1980) 
showed  that  redundancy  of  the  reading  material  did  not  influence  the  time 
necessary  to  reach  criterion  performance  on  the  dual  task  and  that  transfer  from 
raort.  to  less  redundant  material  was  equivalent  to  trsuisfer  from  less  to  more 
redundant  material.  If  participeints  were  alternating  their  attention  between 
the  reading  an-.!  the  writing  tasks,  then  the  more  redundant  reading  material 
should  have  facilitated  .-.his  process  and  tiius  increased  dual-task  performance. 
Additionally,  if  participants  learned  to  exploit  the  redundancy  of  the  text, 
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then  their  performance  should  have  dropped  when  they  transferred  to  the  less 
cedxindant  material.  Hirst  et  al.  also  provided  findings  which  they  interpreted 
as  evidence  against  the  hypothesis  that  the  word-copying  task  becomes  automated 
and  thus  does  not  involve  attentional  resources.  They  found  that  fewer  copying 
errors  were  made  when  the  to-be-copied  materials  were  sentences  relative  to 
random  word  strings  and  that  recognition  memory  for  integrated  sentences 
indicated  that  the  parti 'ipants  had  remembered  implications  of  the  sentences. 
These  resulcs  strongly  suggest  that  the  participants  londerstood  the  sentences 
they  copied  and  therefore  the  copying  task  required  attention  and  was  not 
automatic.  These  researchers  concluded  that  the  notion  of  a  fixed  attentional 
rapacity  is  incorrect;  with  sufficient  practice,  attention  can  be  divided 
without  automatism  or  the  loss  of  conscious  control. 

Salthouse.  A  very  similar  argument  has  been  made  based  on  the  performance 
of  skilled  typists.  Saithouse  (1984)  and  others  have  argued  that  skilled  typing 
is  the  result  of  learning  to  overlap  the  performing  of  its  component  processes 
isee  the  cascade  model  of  McClelland,  1579,  for  a  similar  idea).  Evidence  for 
overlapping  comes  frcm  the  finding  that  typing  speed  is  greatly  decreased  when 
the  size  of  a  preview  text  is  reduced  below  about  eight  characters  and  reaches 
the  speed  of  a  choice  reaction  time  task  with  a  preview  window  of  only  one 
character.  It  is  easy  to  see  how  reducing  the  size  of  the  preview  window  would 
impede  the  overlapping  of  component  processes,  but  tliis  nanipulation  should  have 
been  inconsequential  if  the  typists  were  performing  each  of  the  component 
processes  sequentially.  The  finding  from  eye  movement  studies  of  the  eye-hand 
span  (i.e.,  typists  fixate  several  characters  zihead  of  the  one  being  typed; 
Butsch,  1932)  provides  convergent  evidence  for  the  overlapping  processes  notion. 
Salthouse  showed  that  typi.ng  skill  is  positively  correlated  with  eye-hand  sp  -n, 
suggesting  again  that  typing  skill  is  related  to  the  degree-  to  which  component 
processes  can  ocoar  in  parallel. 

Although  different  typing  processes  can  presumably  occur  simultaneously, 
there  is  good  evidence  that  they  are  not  automatic,  for  exeunple,  typing  errors 
ace  detected  almost  immediately,  as  shown  by  an  increased  latency  to  press  a  key 
following  an  error  (Salthouse,  1984;  Shaffer  1976)  and  by  the  fact  that  a  key  is 
pressed  with  less  force  when  it  is  an  error  (Babbitt,  1978  Wells,  1916). 
Additional  evidence  against  the  claim  tliat  typing  is  automati  is  the  finding 
that  when  participants  were  told  to  stop  typing  whenever  they  heard  a  to  le,  they 
typically  stopped  within  one  or  two  letters  rather  than  typing  to  the  end  of  the 
word  (Logem,  1982). 

The  revolts  from  the  study  of  skilled  typing  are  consistent  with  the 
dual -task  sttidies  described  above.  Through  practice,  multiple  tasks  or  multiple 
components  of  a  task  cam  be  performed  in  paralJel  witnout  the  loss  of  conscious 
control.  These  results  are  at  some  level  inconsistent  with  (or  at  least  suggest 
limitations  of)  the  theories  of  attention  proposed  by  Shiffrin  and  Schneider 
(1977)  and  Hasher  and  Zacks  (1979).  Expleuiations  for  the  typing  results  have 
been  proposed  which  claim  that  a  process  can  be  automatic  tut  still  available  to 
conscious  control.  However,  such  explauiations  can  be  held  only  at  the  expense 
of  making  the  definition  of  automatism  more  vague  and  less  susceptible  to 
experimental  tests. 

Salthouse  (1984;  also  provided  data  which  make  a  simple  component 
explanation  of  typing  difficult.  For  example,  choice  reaction  time,  presumadsly 
a  compfjnent  of  typing,  is  uncocrelated  with  typing  skill  but  is  known  to  be 
slcTwer  for  older  people.  However,  old  and  young  typists  are*  equivalent  in 
typing  speed.  It  appears,  then,  either  that  choice  reaction  time  is  not  i 
component  of  typing  or  that  older  typists  are  able  to  compensate  in  some  way  for 
a  deficit  in  the  reaction  time  component-  Regardless  of  which  explcination  of 
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this  paradox  is  true,  this  finding  suggests  that  examining  component  processes 
out  o.  context  may  yield  misleading  information  concerning  performance  on  the 
global  task. 


Cheng .  As  an  alternative  to  automatism,  Cheng  (1985)  provided  an 
interpretation  of  skill  improvement  based  on  task  restructuring.  Sne  argued 
thac  through  practice,  ttie  components  of  a  task  can  be  reorganii:ed  such  that 
they  operate  in  a  more  coherent,  integrated  manner.  Task  performance  can  be 
improved  by  replacing  less  efficient  modes  of  processing  with  more  efficient 
ones.  In  other  words,  one  can  learn  to  perform  a  task  in  a  new  way,  rather  than 
learning  to  do  the  old  way  faster  or  nnore  automatically.  As  an  example,  Cheng 
contrasted  the  difference  between  solving  the  arithmetic  problem  of  finding  the 
sum  of  ten  2s  by  addition  and  by  multiplication.  Altho=  qh  the  solution  can  be 
obtained  by  either  meirhod,  clearly  multiplication  is  mere  efficient.  Indeed, 
students  learn  how  to  restructure  the  addition  problem  into  a  multiplication 
problem,  rather  than  learning  to  do  numerous  addition  operations  automatically. 

This  account  of  skill  improvement  is  appealing  and  is  consistent  with  the 
theorizing  on  skilled  typing  (Salthouse,  1984)  and  dual-task  performance  (Hirst 
ft  al.,  1900).  The  relationship  between  automatism  and  restructuring  may  be 
best  thought  of  as  supplementary  rather  than  con t radio to r^r,  since  both  processes 
are  probably  operating  to  some  extent  in  the  acquisition  of  all  skilled 
behavior. 

Anderson.  Still  another  in^rtant  alternative  approach  to  the  problem  of 
skill  acquisition  is  Anderson's  (1982)  approach  based  on  knowledge 
proceduralization.  Anderson  claimed  that  during  the  initial  stage  of  skill 
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form,  it  must  be  retrieved  from  long-term  memory  and  held  active  in  working 
memory.  Consequently,  the  use  of  declarative  knowledge  is  associated  with  slow 
retrieval  tiroes  and  the  inliibition  of  concurrent  operations  which  require 
working  memory  capacity. 

Through  practice,  declarative  knowledge  is  compiled  or  procedural! zed. 
Procedural! zed  knowledge  dees  not  have  to  be  entered  into  working  memory  to  be 
acted  upon.,  Conseqraently .  it  can  he  retrieved  quickly  and  its  use  requires 
virtually  no  working  memo  y  capacity.  Whereas  declarative  knowledge  is  flexible 
and  can  loe  modified  to  r^  .ixDve  incorrect  information,  proceduralized  knowledge  is 
much  more  difficult  to  change.  After  a  skill  has  been  proceduralized,  further 
learning  and  improvement  are  ^<-coinpli5hed  by  fine-tuning  the  application  of  the 
operations. 

Anderson's  theory  provides  insight  into  the  utility  of  Uie  skill 
acquisition  process.  t'Jhen  a  skill  is  first  being  acquired,  much  of  what  is 
gained  involves  learning  to  perform  the  skill  correctly  before  learning  to 
perform  it  quickly  or  more  efficiently.  At  this  stage,  it  is  advemtageous  that 
the  task  relies  on  working  memory,  thus  enabling  the  close  monitoring  and 
adjustment  of  skill  knowledge.  Once  the  task  can  be  performed  correctly,  the 
knowledge  does  hot  need  to  be  changed  and  it  can  be  proceduralized  to 
''rystallize  it  and  increase  the  efficiency  of  its  operation. 

This  model  is  similar  to  Cheng's  (1985)  restructuring  hypothesis  in  that 
skill  improvement  is  seen  as  the  result  of  developing  new,  more  efficient 
operations.  It  is  also  similar  to  automatization  theories  (e.g.,  .Shiffrin  & 
Schneider,  1977;  Hasher  &  zacks,  1979)  in  .hat  b«.  h  approaches  stress  that  an 
important  difference  between  a  proceduralized  (automatic)  sk’ll  and  a 
nonproceduralized  (controlled'  skill  is  the  gain  in  speed  and  efficiency  at  the 
cost  of  a  loss  in  control  and  .lexibility.  Doth  theories  are  also  similar  in 
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assuming  that  working  memory  is  strongly  involved  in  learning  a  new  skill  and 
its  role  is  greatly  diminished  as  the  skill  becomes  well  learned. 

Additional  riieoretical  Mechanisms 


Levels  of  processing  and  transfer-appropriate  processing.  Levels  of 
processing  theories  attempt  to  explain  how  different  types  of  processing 
performed  on  stimuli  affect  the  letention  of  those  stimuli.  The  original  levels 
of  processing  framework  (Craik  &  Lockhart,  1972)  attempted  to  explain 
differences  in  . atention  of  words  based  on  the  kind  of  processing  performed  on 
those  words.  Words  which  were  processed  at  a  "deep"  (more  semantic)  level  were 
better  remembered  than  words  which  were  processed  at  a  "shallow"  (structural  or 
phonemic)  level.  Craik  and  Lockhart  hypothesized  that  depth  of  processing  could 
be  indexed  by  the  amount  of  time  the  subject  spent  processing  the  word,  with 
deeper  levels  of  processing  taking  more  time.  Craik  and  Tulving  (1975), 
however,  found  tiiat  in  a  complex  structural  task,  subjects  took  longer  to 
process  a  word  than  in  a  semantic  task,  but  words  processed  semauiticaily  were 
still  better  remembered.  They  attempted  to  find  an  alternative  measure  of  depth 
of  processing.  It  was  discovered  that  subjects  remembered  the  words  to  which 
they  had  responded  "yes"  in  the  learning  task  better  than  words  to  which  they 
had  responded  "no."  The  hypothesis  was  that  words  which  fit  the  context  in 
which  they  were  learned  (words  responded  to  with  "yes")  were  more  tightly 
integrated  with  that  context  than  were  words  which  did  not  fit  the  context, 
when  the  word  fit  the  context,  the  context  could  serve  as  a  retrieval  cuie  for 
that  word.  Craik  and  Tulving  proposed  that  a  process  of  elaboration,  in  which 
the  word  is  tightly  integrated  with  the  learning  context  or  the  event  is 
specified  more  uniquely,  could  explain  the  retention  results.  They  did  not, 
however,  specify  hcnv  to  measure  the  level  of  elaboration;  so,  this  theory  did 
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There  have  been  many  other  criticisms,  besides  lack  of  predictability,  of 
the  depth  of  processing  framework  (see  Baddeley,  1978,  for  a  review  of  some  of 
these  criticisms).  Morris,  Bransford,  and  Franks  (1977)  showed  that  if  an 
appropriate  test  were  used,  a  supposedly  shallow  level  of  processing  would 
produce  better  i > tent ion  than  would  a  semantic  level  of  processing  (on  this 
test).  In  particular,  they  found  that  subjects  trained  with  a  rhyming  task 
showed  better  retention  on  a  rhyming  test  than  those  trained  with  a  semantic 
task  (filling  in  a  word  in  a  sentence  fraane).  They  hypothesized  that  the 
previo  ■?  loenefits  of  semantic  processing  over  "shallower"  processing  were  found 
because  subjects  were  given  a  semantically  oriented  test.  Stein  (1978)  found 
results  similar  to  those  of  Morris  et  al.  by  examining  the  retention  of  semantic 
versus  letter  case  information.  Subjects  given  case  questions  during 
acquisition  performed  better  on  a  subsequent,  case  recognition  test  than  did 
subjects  given  semantic  qijestions  during  acquisition.  Also,  subjects  given 
semantic  acquisition  questions  performed  better  on  a  semantic  recognition  test 
than  did  subjects  given  case  acquisition  questions.  Stein  concluded  that  the 
ability  to  remember  particular  aspects  of  an  input  depends  crucially  upon  how 
well  the  learner  was  able  to  detect  that  information  during  encoding  and  was 
able  to  encode  that  information  distinctly  from  other  aspects  of  the  input. 

^t  ther.’  '  boems  that  an  inportant  conclusion  from  this  work  is  that  the 
form  ,1  nowledge  will  be  used  should  be  foremost  in  determining  how  the 
knowledge  i  .id  be  taught,  in  order  for  optimal  retention  to  )De  demonstrated. 
For  exam;  .e,  if  the  knowledge  will  be  used  phonetically,  it  should  be  taught 
phonetically.  Within  a  partic  Lar  "level"  of  training,  the  more  integrated  the 
information  is  with  the  learning  context,  the  more  distinct  the  information  can 
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generation  effect .  The  generation  effect  refers  to  the  observation 
that  information  which  is  generated  is  better  remembered  than  infonnation  which 
is  simply  read.  The  typical  experimental  paradigm  used  to  examine  this  effect 
provides  the  subject  wivh  a  stimulus  item  and  either  a  rule  for  nene rating  a 
response  item  or  the  response  item  itself.  The  nemerial  advaurage  or  item 
generation  has  been  obtained  with  a  variety  of  generating  tasks,  such  as 
providing  an  opposite  (McElroy  i,  Slamecka,  1982),  transposing  letters  (Gardiner 
5,  Hampton,  1965),  completing  word  fragments  (Glisky  &  Rabinowitz,  1985),  and 
even  providing  the  product  of  a  multiplication  problem  (Gardiner  &  Rowley, 

1984) .  Generation  effects  have  been  found  with  a  number  of  memory  tests,  such 
as  free  recall  (Gardiner  i  Hampton,  1985),  cued  recall  (Graf,  1980),  and 
recognition  (McElroy  &  Slamecka,  1982).  Further,  the  effect  holds  for  several 
types  of  verbal  material,  with  the  exception  of  nonword  letter  strings. 

The  generation  effect  is  clearly  a  robust  phenomenon,  but  a  satisfactory 
theoretical  account  of  it  has  not  yet  been  provided.  McElroy  and  Slamecka 
(1982)  broadly  divided  the  proposed  explanations  into  two  categories,  those 
which  point  to  the  involvement  of  semantic  memory  which  results  from  the  act  of 
generation  and  those  which  stress  the  inherent  difference  between  generating  and 
reading.  Basically,  the  former  type  of  explanation  predicts  that  a  generation 
cask  must  involve  seituuntic  nveroory  to  provide  a  generation  effect,  and  the  latter 
type  of  explanation  predicts  that  any  generation  task  will  result  in  a 
generation  effect. 

The  "genpratp  only"  thpory  is  roncradicfed  by  the  lack  of  a  generation 
effect  for  nonwords.  Because  nonwotds  are  not  a  pare  of  semantic  memory',  this 
finding  has  been  taken  as  evidence  for  the  "semantic  memory''  explanation. 
However,  a  strong  version  of  the  semantic  memory  account,  which  states  that  the 
generated  item  must  be  represented  as  a  unit  in  the  lexicon  (me  itai  dictionary), 
is  also  pro.blematic  because  the  effect  has  been  obtained  with  digits  (Gardiner  & 
Hampton,  1985;  Gardiner  &  Rowley,  1984)  euid  word  pairs  (Gardiner  &  Hampton, 

1985) .  In  a  recent  series  of  experiments,  Glisky  and  Rabinowitz  (1985)  showed 
that  performing  a  generation  task  at  the  time  of  test  enhances  the  generation 
effect  if  the  item  was  generated  during  study  but  not  if  it  was  only  read  at 
study.  Additionally,  they  found  that  the  enhemcement  could  be  obtained  if  the 
seime  specific  op*3ration  was  repeated  but  not  if  the  same  generation  rule  was 
applied  in  a  different  manner. 

Regardless  of  which  mecheinism  produces  the  generation  effect,  its  potential 
application  for  the  retention  of  skills  is  evident.  Unfortunately,  no  research 
has  been  conducted  to  determine  whether  or  not  a  generation  effect  ccin  be 
obtained  at  a  long  retention  interval.  If  the  gene rn^ ion  effect  can  be  extended 
to  long-term  retention,  then  it  would  be  beneficial  to  incorporate  generation 
tasks  in  a  skill  training  program. 

The  role  of  consciousness  m  skill  retention.  It  is  commonly  assumed  that 
information  retrieval  and/or  skill  elicitation  somehow  involves  consciousness  or 
awareness.  This  assumption  deiives  largely  from  traditional  studies  of  memory 
involving  recall  or  recognition  primarily  of  verbal  information  encoded  at  some 
earlier  time.  The  assiunption  is  not  consistent  with  some  recent  results  from 
studies  that  have  assessed  performance  with  tests  that  are  not  closely  tied  to 
any'  particular  prior  e.xperiences  ('lUiving,  Schacter,  &  Stark,  1982).  In  these 
cases,  subjects  exhibit  memory  in  the  .orm  of  facilitated  performance  on  a  given 
task  but  without  any  accompanying  conocious  recollection  of  the  experiences  that 
contributed  to  tliat  facilitation,  /or  example,  Gofer  (1967),  among  others,  has 
cep<octed  that  subjects  aie  more  successful  on  a  word  completion  test  (i.e.,  a 
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test  on  which  the  subject  is  presented  with  two  or  chree  letters  ot  a  word  and 
has  to  fill  in  the  missing  letters)  when  the  target  word  has  receritly  been 
presented  than  when  presented  a  new  word.  Inis  facilitation  of  completion 
performance  has  been  called  a  direct  priming  effect. 


Direct  primina  effects  have  also  been  demonstrated  in  word  identification 
tasks  and  lexical  decision  tasks  (Graf  &  Schacter,  1985;  Tvlvmg,  1981'.  A 
variety  of  constructs  have  been  used  to  distinguish  between  the  t>pe  of  memoiy 
that  IS  tapped  by  priming  tests,  on  the  one  hand,  and  by  traditional  recall  and 
recogiiition  tests  on  the  other.  One  distinction  that  seems  particularly 
appropriate  is  the  distinction  between  episodic,  semantic,  and  procedural  memory 
("’ulving,  1983).  Almost  all  examples  of  episodic  memory,  requiring  as  they  do  a 
reference  to  certain  prior  experiences,  involve  conscious,  effortful 
recollection.  Procedural  memory  and  semantic  memory,  in  contrast,  are  often 
revealed  without  any  apparent  conscious  effort  to  retrieve.  According  to 
Handler  (1980),  effortless,  automatic  memory  is  based  on  the  activation  of 
specific  pre-existing  metory  representations.  Study  materials  used  in  many 
memory  experiments  are  familiar,  individual  words,  already  represented  in 
long-term,  semant.ic  memory  prior  to  their  appearance  in  the  study  list.  Handler 
argued  that  these  pre-existing  representations  are  activated  as  a  result  of 
presenting  the  study  items,  and  that  activation  occurs  automatically  and  thus 
independently  of  the  processes  that  mediate  conscious  remembering.  Such  an 
interpretation  is  supported  by  studies  of  aunnesic  patients.  Amnesic  patients 
are  characterized  by  poor  memory  for  recent  events  but  relatively  nTina.l 
retention  of  older  knowledge  aind  skill  (Squire.  Cohen,  &  Nadel,  1984).  One  of 
the  hallmarks  of  amnesia  is  a  patient's  inability  to  acquire  and  remember  new 
associations.  Nevertheless,  amnesic  patients  show  relatively  normal  priming  or 
effort’ jss  retention  when  the  study  materials  are  familiar  items  that  have  a 

ow  t  ^  ^  C  A  ^  A 
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of  normal  priming  effects,  in  conjiaiction  with  the  observation  that  amnesic 
patients  seem  unable  to  acquire  new  associations,  is  consistent  with  the  view 
that  effortless  memory  is  mediated  by  the  activation  of  pre-existing 
representations.  Learning  new  episodes  meeins  there  is  no  pre-existing  memory 
representation.  Thus,  activation  camnot  produce  correct  remembering.  Further, 
whatever  the  mechanism  is  for  estadilishing  new  representations,  it  seems  to  loe 
defective  in  amnesic  patients.  But  priming  derives  from  already-existing 
representations  which  can  be  activated  automatically  to  produce  a  memory-like 
effect. 


We  take  these  studies  to  have  a  further  implication.  When  a  test  is 
arranged  for  a  skill  learned  earlier,  it  is  typically  found  that  skilled 
performamce  falls  below  the  level  achieved  during  original  practice.  This 
suggests  that  either  (a)  the  skill  as  a  whole  or  all  of  its  components 
deteriorated,  in  some  sense,  over  time  without  exercise;  or  (b)  skill  components 
differ  in  their  persistence,  some  of  them  being  available  for  retrieval  in 
full-blown  form  at  a  later  time  while  others  have  significantly  decayed.  On  the 
basis  of  results  that  distinguish  between  automatic  and  effortful  retrieval  of 
items  or  components  from  memory,  we  suggest  the  hypothesis  that  some  components 
do  indeed  persist  unaltered  over  tine  and  can  be  activated  by  the  reintroduction 
of  the  appropriate  stimulus  to  thei  mental  representation.  Other  components, 
in  contrast,  which  are  generally  connected  to  the  changed  circumstances  beti^een 
original  training  and  subsequent  test,  or  are  heavily  context  dependent,  might 
not  be  so  readily  available.  Wherever  a  component  lacks  mental  representation, 
for  whatever  reason,  automatic  activation  will  not  work.  The  subject  must 
engage  in  a  conscious  search  of  raenxsry  and  possibly  the  establishment  or 
leestablisbjnent  of  mental  representation  in  support  of  that  component. 
.Available  evidence  suggests  that  memory  for  motor  skills,  especially  highly 
integrated,  continuous  motor  skills,  is  excellent  because  retrieval  of  the 


appropriate  representation  tends  to  be  effortless.  Skill  components  tied  to 
particular  episodes  or  experiences  requite  effortful  search  and  possible 
re-encoding,  with  the  iniplicaticn  that  their  performance  may  be  relatively 
deteriorated  after  prolonged  periods  of  time. 

Conclusions 

In  conclusion,  the  literacure  shows  tnat  the  lor.g-term  retention  of 
kn  wledge  and  skills  is  a  complex  multifaceted  problem.  The  research  conducted 
to  date  on  long-term  retention  has  already  uncovered  seme  of  the  psychological 
principles  involved  in  its  promotion.  Further,  investigations  of  the  •  -hanisms 
believed  to  mediate  skill  acquisition  have  provided  principled  account,  of  how 
and  when  permanent  memory  should  occur.  Additional  investigations  of  such 
mechanisms  for  very  Icng-term  memory  will  determine  whether  or  not  these 
mechanisms  are  applicable  to  skill  retention.  It  is  our  belief  that  a 
principled  investigation  of  skill  maintenance  will  provide  useful  applications 
for  improving  retention  and  provide  a  major  contribution  to  both  the  military 
and  civilian  communities. 
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